Converting and Annotating Quantitative Data Tables
نویسندگان
چکیده
Companies, governmental agencies and scientists produce a large amount of quantitative (research) data, consisting of measurements ranging from e.g. the surface temperatures of an ocean to the viscosity of a sample of mayonnaise. Such measurements are stored in tables in e.g. spreadsheet files and research reports. To integrate and reuse such data, it is necessary to have a semantic description of the data. However, the notation used is often ambiguous, making automatic interpretation and conversion to RDF or other suitable format difficult. For example, the table header cell “f (Hz)” refers to frequency measured in Hertz, but the symbol “f” can also refer to the unit farad or the quantities force or luminous flux. Current annotation tools for this task either work on less ambiguous data or perform a more limited task. We introduce new disambiguation strategies based on an ontology, which allows to improve performance on “sloppy” datasets not yet targeted by existing systems.
منابع مشابه
DisambiguatingWeb Tables using Partial Data
This work addresses disambiguating Web tables annotating content cells with named entities and table columns with semantic type information. Contrary to state-of-the-art that builds features based on the entire table content, this work uses a method that starts by annotating table columns using automatically selected partial data (i.e., a sample), then using the type information to guide conten...
متن کاملA Tool for Creating and Visualizing Semantic Annotations on Relational Tables
Semantically annotating content from relational tables on the Web is a crucial task towards realizing the vision of the Semantic Web. However, there is a lack of open source, user-friendly tools to facilitate this. This paper describes an extension of the TableMiner system, an open source Semantic Table Interpretation system that automatically annotates Web tables using Linked Data in an effect...
متن کاملAn XML-based Approach to Handling Tables in Documents
We explore application of XML technology for handling tables in legacy semistructured documents. Specifically, we analyze annotating heterogeneous documents containing tables to obtain a formalized XML Master document that improves traceability (hence easing verification and update) and enables manipulation using XSLT stylesheets. This approach is useful when table instances far outnumber disti...
متن کاملA Framework for Annotating and Visualizing
................................................................................................................. II ACKNOWLEDGEMENT ................................................................................................ III TABLE OF CONTENT ................................................................................................... IV LIST OF FIGURES ..............................
متن کاملModeling without Borders: Creating and Annotating VCell Models Using the Web
Biological research is becoming increasingly complex and data-rich, with multiple public databases providing a variety of resources: hundreds of thousands of substances and interactions, hundreds of ready to use models, controlled terms for locations and reaction types, links to reference materials (data and/or publications), etc. Mathematical modeling can be used to integrate this complex data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010